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[57] ABSTRACT 

^ A data processing system and method for selecting securities 
' and constructing an investment portfolio is based on a set of 
artificial neural networks which arc designed to model and 
track the performance of each security in a given capital 
market and output a parameter which is related to the 
expected risk adjusted return for the security. Each artificial 
neural network is trained using a number of fundamental and 
price and volume history input parameters about the security 
and die underlying index. The system combines the expected 
return/appreciation potential data for each security via an 
optimization process to construct an investment portfolio 
which satisfies predetermined aggregate statistics. The data 
processing system receives input from the capital market 
and perioditiily evaluates the performance of the invest- 
ment portfolio, rebalancing it whenever necessary to correct 
performance degradarions. 

23 Claims, 7 Drawing Sheets 
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PREDICTIVE NEURAL NETWORK MEANS 
AND METHOD FOR SELECTING A 
PORTFOLIO OF SECURITIES WHEREIN 
EACH NETWORK HAS BEEN TRAINED 
USING DATA RELATING TO A 
CORRESPONDING SECURITY 

FIELD OF INVENTION 

The present invention relates to a financial data process- 
ing system and method designed for investors whose objec- 
tive is to obtain a return on their investment portfolio which 
is superior to the broad index benchmarks of a given capital 
market More particulariy, the system and mrthod of this 
invention arc preferably carried out using artificial neural 
oetworics capable of estimating the ^>preciation potential of 
the individual securities in a capital market. The system of 
this invention uses the appreciation estimates for the indi- 
vidual securities to create and administer investment port- 
folios with varying time horizons. 

BACKGROUND OF THE INVENTION 

Individual and institutiooal investors apply various selec- 
tion and optimization strategies in the process of aeating a 
securities portfolio which can provide higher return on their 
capital investment Such strategies are frequently based on 
research in a particular industry, historical price data for 
individual stocks, analysis of fundamental data. etc. A com- 
mon problem in formulating an investment strategy is. 
however, the overwhelming amount of the available maricet 
Information which cannot be readily interpreted. For this 
reason, investors are confined to specific techniques which 
rely on limited subsets of data. 

For example, many investors employ a technique known 
as market timing, which involves investing in die equity 
markets at the perceived time of market growth and divest- 
ing at a later time of perceived market contraction. This 
strategy is usually based on timing the cycles for the market 
as a whole which tends to avoid the risk associated with 
owning individual stocks. A problem with this approach is 
dial accurate timing of the market cycles involves, by 
necessity, the compression of large amounts of statistical 
data into analytical worksheets, graphs, bar charts or other 
quantitative depictions of the information so as to enable a 
skilled analyst to make an investment decision. 
Unfortunately, the compression process may obscure rel- 
evant trends in smaUcr subsets of data. Further problems 
arise if it is desired to follow the performance of foreign 
markets because additional information such as currency 
exchange rates has to be taken into account. 

The advent of computerized trading and other fOTms of 
advanced information processing in recent years has created 
a new family of global investment products, such as com- 
modity options; real estate and currency funds; financial 
futures contracts and other derivative instruments based on 
the underlying securities. Also, investment v^cles have 
been offered to track the performance of select foreign 
markets. A problem common to all these investment prod- 
ucts is that in order to obtain desirable return rates, it is 
required to use specialized knowledge in the respective 
investment fields. 

Among the various investment options, mutual funds 
which offer a variety of investment options tailored to 
specific customer needs have gained popularity in last few 
years. One such option is presented by the so-called indexed 
stock funds which are designed to track the performance of 
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broader market benchmarks, such as the S&P (Standard & 
Poor's) 500 index. The S&P 500 index is a market capitali- 
zation weighted basket of the stocks of 500 large companies, 
indicative of the performance of the U.S. equity markets. 
5 The indexed stock funds usually invest in the stocks of the 
S&P 500 companies targeting their relative weights in the 
index and, therefore, directly track the performance of the 
S&P 500 index, aided by lower asset research fees and 
transaction costs. Such financial products offer convenience 

10 and relatively low fees but are meant to follow and match the 
market, not ouq>erform it 

If the investment goal is to surpass the underlying market 
benchmark, the task becomes one of holding a smaller 
subset of all stoc^ of the nurket, such that this subset has 
a higher expected return and about the same level of risk as 
the market index. Such task requires one to focus on 
individual stocks and their performanoc in relation to the 
index that serves as ttie underiying performance benchmark. 
Individual stocks usually have their own unique peifor- 

^ manoe characteristics. Information that is specific to a par- 
ticular stock includes earnings-related data, the company's 
growth plans, personnel changes and the public's perception 
of the company. While some of these factors (sudi as 
personnel) are not readily quantifiable, others such as eam- 

^ ings and projections (e.g., earnings estimates and tiieir 
revisions) can be quantified. Qearly. however, the relation- 
ships among such data are complicated and frequently 
non-linear, making them difficult to analyze. In sununary. an 
investment decision in the modem capital markets requires 

^ processing of large volumes of data and taking into account 
a number of factors which may exhibit significant non-linear 
relationships among different components of the data. 

Computers, in genaal, are very adept at dealing with large 
amounts of numerical information. However, sophisticated 
techniques are required to analyze and combine disparate 
information that can potentially impact security prices. Sev- 
eral expert conputer systems have been deployed in the 
domain of finance, including some in the area of investment 
management. 

In the past several years, neural networks ("neural nets") 
have become popular in solving a variety of problems 
ranging from protein secondary structure prediction to 
speech recognition. Neural nets mimic the ability of the 
human brain to recognize recurring patterns that are not just 
identical but similar. A neural net can iM-edict the value of 
one variable based on input from several other variables that 
can in^»act it. The prediction is made on the basis of an 
inventory of patterns previously learned, seeking the most 

^ relevant in a particular situation. Unlike traditional Artificial 
Intelligence (AI) systems, whidi are rule-based, neural nets 
can ^Icam*" by exanq^le and modify themselves adjusting 
and adaf^g to dianging conditions. 
Several applications of neural nets to the domain of 

55 finance are already known in the art. 

U.S, Pat No. 5,109,475 to Kosaka ct al. discloses a neural 
network for selection of time series data. This method is 
illustrated in a particular application to the problem of stock 
portfolio selection. In the first step of the proposed method, 

60 certain characteristics for each security are calculated from 
a time series data related to the security. The diaracteristics 
to be computed include the historical risk (variance and 
co-varianoe) and the return. The following step involves the 
establishment of a parformance function based on the cal- 

65 culated characteristics and, in the third step of the method, 
a Hopfield neural network is used to select a subset of 
securities from a predefined universe. Due to the fact that the 
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Kosaka system only considers historical risk and return data, 
and implicitly assumes that the relationship between risk and 
return factors will remain stable in the future, in a typical 
rapidly changing market environment, it is unlikely to 
predict accurately price variations which are subject to 
complicated non-linear relationships. 

U.K Pat. application 2 253 081 A to Hatano et al. discloses 
a neural net for stock selection using only price data as input 
The main idea of the pressed system is to calculate runs 
(sequences) of price trends, increases and decreases, using a 
point-and-figure chart and using the maximum and mini- 
mum values from the chart to make a time- series prediction 
using a neural network. As in die previous case, the Hatano 
system only uses historic price data whidi places limitation 
on the type and quality of predictions that may be achieved. 
Additionally, the use of only the external points of the price 
chart obscures even further information about any time 
dependencies that might be present in the original data. 

The above-described financial systems do not fully utilize 
the potential of the neural nets for stock selection and 
optimized portfolio construction. Notably missing is the 
capability to take into account a combination of technical 
(historical price, volume, etc.) and fundamental factors 
(earnings, cashflow, etc.) for each security in a capital 
market Investment management systems known in the art 
have not been able to efiFectively combine technical and 
fundamental information in a computerized framework 
involving data processing systems and communication net- 
works. 

SUMMARY OF THE INVENTION 

It is accordingly an object of the present invention to 
provide a dau processing system to develop and administer 
an investment p<rfolio expected to outperform, if possible, 
broad indices associated with domestic <x foreign coital 
markets. 

It is another object of the present invention to present an 
artificial n^al network system for estimating Ae apprecia- 
tion potential of each security in a capital market using both 
fundamental and technical information about the security. 

It is yet another object of the present invention to develop 
a method for stock selection on the basis of appreciation 
potential parameters estimated using a neural network sys- 
tem for each stock in a given capital market 

Yet another object of the present invention is to provide a 
data processing system and method to create an investment 
portfolio subject to certain risk parameters and to rebalance 
the ponfoljo periodically to increase the expected return of 
the portfolio while maintaining the investment risk at a 
similar level. 

These and other objectives are achieved in accordance 
with the present invention using neural networks which can 
be trained to make predictions of the relative performance of 
a stock over a predetermined period of time vis-a-vis the risk 
adjusted return of the corresponding market index. The use 
of neural nets is particularly well suited to the problem of 
investment decision-making because it provides the capa- 
bility to capture non-linear functional relationships among 
input variables which are not easily modeled or captured by 
more traditional methods of security analysis and selection, 
such as multifactor models based on linear regression. 

Specifically, in the preferred embodiment, each stock in 
the capital market is tracked by an individual neural net 
which receives input signals representing both fundamental 
and technical information about the stock. Fundamental 
information includes earnings-related data at>out the stock 



iK442 

4 

while technical information includes price-derived data 
about the stock and the various market indices. The funda- 
mental and technical information about each stock* appro- 
priately organized, is used as a predictor or indicator of the 

5 future performance of the stock. (For example, one such 
indicator Is the volatility of the stock returns over a select 
period of time). The neural net is input a set of indicators 
conesponding to a snapshot in time. Each of these indicators 
is derived using a lookback period, tfius encoding a memory 

10 effect for each input. The necwcffk is trained using different 
sets of inputs (and corresponding outputs), tested on two 
different sets of data (termed test set and validation set) and 
then used to make forecasts using current market data. The 
type of indicators and the process of selecting values to be 

IS used as input data to the neural nets are determined by 
factors such as how far ahead the system is forecasting (its 
time horizon), the presence of periodicities in the data and 
the correlation b^een the indicator and ttie performance of 
the stock, the indices of the capital market, etc. 

^ Following the selection of input indicators, each neural 
net is trained with the available historical data. The training 
process continues until at least one stopping criterion is met 
Such criteria include the determination that the connections 
between the nodes of the net have reached a steady state, that 

25 the error between the predicted output and the actual target 
values is less than a certain threshold, or that a predefined 
time period has ele^sed without any improvement in the 
net's performance. Once the neural nets for each stock of the 
capital market have been trained and tested on the available 

^ historical data, in accordance with the preferred embodiment 
of the present invention, the neural nets are used to make a 
prediction of the appreciation potential of each stock. 

In a suitable time interval, such as one week, ttie appre^ 
ciation potentials of the stocks in the capital market are 
recon4>uted and the neural nets are trained again using the 
historical data plus the newly available data. Each week the 
trained networks estimate the appreciation potential of the 

stocks 1, 2 M weeks ahead, so that comparing earlier 

overlapping estimates to available data about die stocks can 

^ provide a measure of how good each appreciation estimate 
is. 

The system of the present invention correspondingly 
comprises means for choosing the appropriate input data 
which consists of a set of price-derived indicaton as well as 
earnings-based rankings; means fcx* pre-processing the data 
into appropriate categories and inputs; means for training the 
neural netwco^ks; means for determining the end of the 
training process; and means for accessing the appropriate 
^ data and making predictions for a given time period. 

The appreciation potential of each stock determined by 
the neural nets is used to generate an optimized investment 
portfolio comprising a selected number of stocks. Each 
stock is allocated a weight based on its estimated perfor- 
ms manoe potential within the prescribed future time period and 
other parameters related to the investment risk. These 
weights form the basis of t>oth long and short portfolios. The 
"long" portfolio includes a prespedfied number of stocks 
which are estimated to have the best appreciation potential 
60 Conversely, the "short** poitfolio includes stocks which are 
estimated to have the worst performance potential for the 
predetermined time period. An investor utilizing die system 
of the present invention may then make investment deci- 
sions (such as buying or selling stocks) on the basis of the 
65 suggested model portfolios. 

More specifically, the portfolio of stocks is created in 
acccvdancc with the present invention using a pratfolio 
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Optimizer which employs the pciformaoce potentials esti- 
mated by the neural nets, a measure of how good the 
estimates are. forecasts of economic and financial variables, 
risk allocation factors, investor jseferences and guidelines, 
and other factors. This optimizer is designed to suggest a 
portfolio of stocks which is expected to have similar risk as 
the overall capital market (<a: risk according to the investor's 
preference), but to ou^>crform thfc investment return of the 
associated market benchmarks over a predetermined period 
of time. 

The method and system of the present invention further 
provide continuous monitcring of the actual performance of 
the stocks of the capital market and comparison of this 
performance to the estimates obtained firom the neural nets. 
Should a discrepancy between die estimated and the actual 
return larger than a predetermined threshold be detected, the 
system can rebalance the con^sition of the determined 
model portfolio, so that the investors using the system can 
adjust their positions in the capital market Similarly, the 
portfolio can be rebalanced if the neural nets indicate 
different stocks with significantly higher or lower apprecia- 
tion potentials (based on newer data) over the relevant time 
period. 

The method and system of the present invention provide 
an effective and efiScient tool for optimizing investment 
decisions by utilizing the c^ability of neural nets to take 
into account complex relationships among different market 
and stock-specific factors. The method and system further 
incorporate data processing optimization techniques that use 
the performance estimates for each stock obtained from the 
neural nets to construct portfolios tfiat in^)rovc the returns 
on capital investments. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing features of the present invention may be 
more fully understood from the following detailed disclo- 
sure of a specific iH'efeired embodiment in conjunction with 
the accompanying drawings, in which: 

FIG. 1 is a block diagram of the financial management 
system far creating and administering an investment port- 
folio in accordance with the preferred embodiment of the 
p-esent invention. 

FTG. 2 illustrates, in a block diagram form, the method of 
the present invention for creating and administering an 
investment portfolio optimized over a predetermined period 
of time. 

FIG. 3 illustrates, in a block diagram form, the steps 
required to provide the inputs used for training and operating 
the neural net for a selected stock. 

FIG. 4 shows, in a block diagram form, the steps for 
creating and using a neural net for estimating the perfor- 
nunce potential of an individual stock. 

FIG. 5 is a schematic diagram of a neural net configura- 
tion employed in the preferred embodiment of the present 
invention. 

FIG. 6 illustrates, in a block diagram form, the method of 
extracting the training, test and validation sets from the 
entire data set in a prefexrcd embodiment of the present 
invention. 

FIG. 7 depicts an cxair^le of a display of the actual and 
predicted 12 week ALPHAs from the validation set for a 
specific stock using one embodiment of the present inven- 
tion. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 
FIG. 1 illustrates in a block diagram form the financial 
management system 9 of the present invention which pro- 
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vides investors 7 with a way to invest in capital markets 1. 
Capital markets 1 may represent one or more domestic 
and/or foreign stock n^^kcts and/or indices associated with 
the performance of sudi markets. In a preferred embodiment 
5 of the present invention, financial management system 9 
comprises: a pr^ocessing system 305 receiving and select- 
ing input for the stocks in the capital market and general 
market data; a set of neural networks 300 for estimating the 
peifornmnce potential of the stocks in the market; data 
jQ processing system 310 designed to create and administer an 
optimized investment portfolio; storage 320 for storing price 
and other infonnatiOD about the stocks in die capital market, 
as well as investor preferences; storage 330 for storing 
miscellaneous performance characteristics associated with 
the stocks; and interactive communication means 340 that 
provides a two-way conununication between the investors 7 
and the financial management system 9. In one embodiment, 
financial system 9 is implemented on Pentium PCs. Hewlett- 
Packard. Sun Microsystems workstations, as known to pcr- 
2Q sons skilled in the art 

Other computer systems known in the art can be 
employed as a hardware platform for the system of this 
invention. Such computer systems, as known in the ait 
comprise a central processing unit (CPU) memory (RAM. 
2j ROM, etc.), input and output devices and user interfaces. 
FIG. 2 illustrates, in a block diagram form, the method of 
the preferred embodiment of the present invention for cre- 
ating and administering an investment portfolio optimized 
over a predetermined period of dme. Specifically, blocks 10, 
JO 20 and 30 illustrate primary sources of information about the 
stocks in the capital market (for example stocks listed in the 
S&P 500 (Large C^) and the S&P 400 (Mid Cap) indices 
in die U.S.) and about broad market indices associated with 
the market Block 10 represents selected technical input 
35 information for each stock, such as the price and trading 
volume of each stock. Block 20 represents returns associated 
with the indices in the capital marlcet (c.g.. the S&P 500 and 
S&P 400 indices), and block 30 represents fundamental 
information such as earnings related data. In a preferred 
40 embodiment of the present invention, the output of block 30 
includes the "Surprise Forecaster Ranking** from Chicago 
Investment Analytics. Inc.. 208 South LaSalle St. Suite 608, 
Chicago, ni. 60604. This data indicates the probability of a 
positive earnings surprise, i.e. the probability that a com- 
45 pany will report better earnings than the analysts' estimates. 
Other data from the same source which may be used in block 
30 includes the "Momentum Alert,** the "Growth Alert,** the 
*'Value Alert**, etc. In other embodiments, a person skilled in 
the art on the basis of this disclosure will be able to use other 
50 fundamental data from other sources. 

The combined technical and fundamental information 
from blocks 10. 20 and 30 is then passed to pre-processing 
block 40, which indicates the functions perfoimed by the 
system 305 in FIG, 1. fte-proccssing block 40 generates a 
55 number of data sequences, each sequence representing the 
values of certain parameters of the stock computed at 
predetermined time intervals in the past iH*eferably one 
week apart These data sequences are specific for each stock 
and are subsequently used to train the neural net for each 
60 stock. Each data sequence has been historically found to 
relate to the performance of the stocks and thus may be used 
as a predictor of their fuoire paformance. Appendix A 
provides a listing of the data sequences of the preferred 
embodiment of the present invention, along with their 
65 definitions, which are preferably generated at block 40. This 
data includes the parameter BETA (p) which Is a measure of 
volatility of the stock relative to an underlying benchmark; 
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ALPHA (cx) which is a measure of the excess return of the portfolio on the basis of the appreciatioii potential estimates, 

stock over (lie beta-adjusted beachmark; the standard dcvia- In this block each stock is allocated a weight proportional to 

tion of the parameter ALPHA conq>uted over a predeter- its estimated ^preciation potential within the ^^'edetermiaed 

mined period of time; the slope of the earnings surprise future time period, such that the risk constraints and investor 

rankings and other parameters, as defined in Appendix A and 5 preferences are satisfied. Such weighing generates model 

explained in more detail below. On the basis of the present i^ng and short portfoUos. These portfoUos comprise a 

disclosure, a person skaied in the art will be able to use other selected number of stocks which are estimated to have 

parameters as well respectively the best and the worst appreciation potential 

Functional block 50 in HG. 2 denotes the steps required j^c predetermined period of time. An investor utilizing 

to build and operate a neural network 300 for eadi slock ^ ^ ^ invention may then make invest- 

with the input factors above appropriately scdcd The first ^^^^ decisions on the basis of the suggested model portfo- 

step in me process is to se ect an arddtecture for die neura^ ^.^^ ^ mo^m description only the "long" portfoUo, 

network. In accordance with one embodiment of the present ^t^tc «h-T«at^H tn hav/rt^i- h^ct T^Jfr^»r,n^ 

invention, the architecture of the neural network is a mul- compnsing stocks estimated 

tilayer feedforward neural net with backpropagation imple- prespecified penod of time^ wm be considered 

mented using the generalized delta ruk! as dcscribcdfor because the selection process for toe short" portfolio, 

example in Rumelhart et ai, "ParaUcl Distributed Process- compnsmg stocks expected to have the wont performance, 

ing: E?^5lOTations in the Microstnicture of Cognition." Vol. >s essentially idenUcal (especially if modeled using negative 

1, Ch. 8. MTT Press, Cambridge. Mass., whidi is inccrpo- weights during tiic portfoUo conslniction process), 

rated herein by reference. The neural nets defined in this way Finally, in block 70 ttie system of the present invention 

can be simulated using commercially available software 20 continuously monitors the performance of the selected 

packages such as the BrainMaker Professional marketed by stocks and compares the return of the model portfolio to the 

California Scientific Software. NeuralWorks Professional overall investment return in the capital market. Should the 

n/Plus from NeuralWare Inc., Neuroshell 2, distributed by output of the neural nets for die specific time period indicate 

the Ward Systems Group, Frederick, Md. 21702, or others. j^e appreciation potential of certain stocks has changed 

For example, a custom batdi version of the Neuroshell 2 25 significantiy, data processing system 310 in HG. 1 may 

I^ogram used in a preferred embodiment of Uie present rebaUnce the composition of the suggested optimized model 

invention is capable of simulating over 10 000 nets which ^^^^ ^^^^^^^ ^ ^^^^ 

can be trained m a batch mode r^tcdly with the net ^^^^^ ^ accordingly, 

configuration for each stock (architecture, stoppmg catena, „ ^ . . ^. ^ 

etc.) specified in a system file. NeuroSheU 2 User^ Manual ^ ^9'^ illustrates m a block dia^am form, the steps 

available from Ward Systems Group (and particularly its required to generate tiie data for use as input to the neural net 

chapter 5) is hereby incorporated by reference. Appendix B a ff^en stocfc Assunjmg that ^hesy suma ^ ^es^^ed t^ 

is a listing of die batch configuration file used in the system ^ performance of tiic U.S. S&P 500 and S&P 400 

of d,e present invention for the purpose of training and '^^<^}^ preproassing system 305 first calcuUtes 

testing Se neural nets. A pason skilled in tiie art, on the 35 » ^/^^ " ^^^^^^ 

basis of the present disclosure, will also be able to use otiiCT ^^^^ returns. 

neural net software including developing such software. A sequence of historical stock data is used to generate the 

Neural nets 300 in FIG. 1 which correspond to the stocks in inputs used to train the network f<x a given stock. The 

the capital maikct may also be in^lemented in hardware. in^)ortance of newly available data in relation to the less 

The architecture of the neural net may also be varied (by 40 recent data is reflected by die use of the weight factors. Thus, 

means of changing die input parameters, the number of » weight factor w^ associated with the inputs during the i-th 

hidden layers, by modifying the activation functions, or by time period is calculated in step 105. The value of die weight 

other means) if in the course of operation it becomes factor w for the i-di time period is given by Uie formula: 
necessary to more accurately predict the performance poten- 

tial of the underlying stocks. Thus, different architectures of 45 *^ 

die nets can be selected for different securities so as to where Z is the sum of all time periods from 1 to N, N being 

improve performance. die number of time periods used in the "lookback'* process. 

The neural nets for alt individual stocks are then trained In a preferred embodiment of die present invention the unit 

using the data sequence generated in block 40. The training time period is one week, and the number of weeks N is set 

process continues until the connections between die nodes of 50 between 40-120, preferably S4 weeks. As indicated, die 

the nets reach a desired state, as determined by an appro- application to the input data of a ^Yorgetting factor^ w, is 

priate stopping criterion, as well known in the art and based on the observation diat more recent information about 

discussed in more detail below. Once the neural nets for die a stock is usually more relevant to the prediction of the 

stocks of a capital market have been trained in accordance future performance of the stock. 

with the present invention, in block 60 the neural nets are 5s In the following steps 107 and 109, preprocessing system 

used to make a prediction of the appredaUon potential of 305 in^>orts values of the earnings surpise rankings and 

eadi stock within a predetermined period of time. These computes its slope. Preferably, the slope computation is 

predictions are made by running new data through the done over a 6-week period. As indicated above, sudi data is 

trained neural nets. The output of each neural net 300 is die available from Chicago Investment Analytics. Inc., or 

parameter ALPHA which is an estimate of the appreciation 60 another ^jpropciate source. In st^s 110 and 120, prepro- 

potcntial of the corresponding stock over die predetermined ccssing system 305 computes die BETA parameter and its 

period of time. As described below, some factors generated slope, respectively, while in step 125 the diversiftable van- 

in block 40 are also used in the portfolio construction ancc is computed. The ALPHA parameter, its standard 

process to ensure that the portfolio satisfies investor prefer- deviation, and calculations based on the cumulative ALPHA 

cnces and certain aggregate risk characteristics. 65 stochastic parameter (such as its slope and inflection) for the 

Block 60 in FIG. 2 indicates die operation of the system stocks are computed in stq>s 130, 135, 138, 140, 142, 145 

310 in FIG. 1, which is used to create an optimized stock and 155 as defined in Appendix A. The sequence of Steps for 



01/12/2004, EAST Version: 1.4.1 



5,761 

9 

the coniputatioD of the hybrid index return, the earnings 
surprise data, the BETA and ALPHA parameters and other 
factors derived from such information is shown in FIG. 3 for 
illustrative purposes only and may also be computed in a 
difiFerent order. 5 

The following steps 157-205 illustrate the sequence used 
in one embodiment of the present invention to compute 
values for ail remaining input parameters, as defined in 
Appendix A. Thus, in steps 157 the power factor is 
computed, in steps 160 and 165 the negative and the positive lo 
power coefficients arc computed, respectively; in step 170 
the power slope is computed; in step 175, the LPM factor is 
computed; in step 180 the historical ALPHA parameter for 
the stock is computed; in step 185 the system computes the 
residual, in step 190 the long-term Price/Earnings (P/E) ratio 15 
t>a5ed on scaled earnings is confuted. In steps 194 the 
relative P/E parameter is computed, in step 196 the Average 
Volume factor for the stock is obtained. Id step 198, pre- 
processing system 305 computes an estimate for die future 
stock return and on the basis of this estimate computes in 20 
step 200 values for the ALPHA parameter of the stock for a 
period M weeks into the future (preferably, the conqjutation 
is done for 4. 12 and 24 weeks). The last step 205 in FIG. 
3 illustrates the sq^aration of the available input data into a 
training, testin g. validation and production (referred to as : R, is 
S, V, P. respectively) sets as explained in more detail below 
and illustrated in FIG. 6, 

In accordance with a prcfeircd embodiment of the present 
invention, the ALPHA parameter estimate is a measure of 
the appreciation potential, and thus the attractiveness of the 30 
stock, and is used by the system to determine which stocks 
to be included in the model stock portfolio. The number M 
of weeks which the system looks ahead (the time horizon of 
the system) is in the range of about 4-30 weeks and is 
preferably set equal to 12 weeks. The confutation of input 35 
quantities which are related to the change in the value of a 
given parameter, such as its slope or inflection, is typically 
done using the differences between the values of the param- 
eter over a unit period of time. 

The method in FIG. 3 Is merely illustrative of the method 40 
of the present invention, as applied to die problem of 
predicting the appreciation potential of the stocks. As indi- 
cated above, other indicators may be used dependent on the 
time horizon of the system, the required degree of accuracy 
and other factors. Input parameters are diosea from a broad 45 
set of inputs which are derived from fundamental data about 
the stock, its price, volume, and index price data. Whether 
CH- not a given input should be included as a predictor of the 
ALPHA parameter is determined using sensitivity analysis, 
as known in the art Thus, if a particular input improves the 50 
prediction accuracy of the net over a period of time, it is 
included in the set of inputs. If. on the other hand, a specific 
input docs not measurably correlate with the ou^ut ALPHA 
parameter, as indicated by sensitivity analysis of the predic- 
tion results over a period of time, tfiis input may be dropped. 55 
Thus, the inputs presented in Appendix A, while illustrative 
of the general operation of the system of the preferred 
embodiment, may from time to time be supplemented by 
other inputs which show a better prediction accuracy for the 
particular stock. The process of selecting the inputs for the 60 
prediction of the appreciation potential of each stock in the 
market is an ongoing process which aims at finding a set of 
inputs which can provide good prediction accuracy. On the 
basis of the present disclosure, a person skilled in the art will 
be able to select such inputs. 65 

FIG. 4 shows the steps for creating and using neural nets 
to provide an estimate of the appreciation potential of 
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individual stocks on the basis of data generated in the 
prqwoccssing system 305. In step 200. the system operator 
selects the input and output parameters of the neural net 
training process. The selection of input parameters is 
described above and illustrated in FIG. 3. In a preferred 
embodiment of the present invention the output of each 
neural net is an estimate for the value of the ALPHA 
parameter of a given stock M weeks ahead. 

The following step 210 of the method involves normal- 
ization and scaling of the values provided to the system 
widiin pre-specified boundaries. Several scaling opdons are 
available. In a preferred embodiment of the present 
invention, the option is to use linear scaling of the input 
parameters between -1 and +1 with allowance for over- 
shoots in the new data. Other available scaling options 
include: 

a) linear scaling between -1 and +1, no overshoots 
allowed; 

b) linear scaling between 0 and 1, no overshoots allowed; 
and 

c) linear scaling between 0 and 1 with allowance for 
overshoots. 

Scaling of the input parameters of a neural net is well 
known in the art 

In step 220 the architecture of the network is selected. As 
illustrated in FIG. 5, in accordance with a preferred embodi- 
ment of the present invention, each neural netwOTk consists 
of three layers of nodes with links between the nodes in each 
layer and juii^ connections between the input and output 
layers. Jnpui layer 12 has preferably 15-20 input nodes, 
receiving inputs from preprocessing system 305. Input layer 
12 is connected directly to the output layer 18 which only 
has one node. The hidden layer of nodes is split into two 
groups: a) a group of nodes 14 preferably using a Gaussian 
activation function (exp(— x^)); and b) a second group of 
nodes 16 preferably using a Gaussian complement aalvation 
function (l-exp(-x^)). Other activation functions, such as 
the logistic (l/(l+exp(-x)). the symmetric logistic (2/( 1+exp 
(-x)>-l), the sine (sin(x)), or the tanh (tanh(x)). where tanh 
is the hyperbolic tangent, may also be used in other embodi- 
ments for the nodes of the neural net The hidden layer 
groups 14 and 16 have preferably 5-13 nodes each and 
receive input from the input layer 12. The output node layer 
18 of the system preferably uses a logistic activation func- 
tion. As indicated above, the network itself is Implemented 
using a standard software package, ninning on a computer, 
or in other embodiments using customized software and/or 
hardware. 

Following the selection of the network architecture, in 
accordance with the method of the present invention steps 
230, 240 and 250 in FIG. 4 implement a neural net training 
process. In a specific embodiment a total of 3,000 neural 
nets are trained, one for each stock in the S&P 500 and S&P 
400 indices and 2,100 other stocks which are selected using 
capitalization and liquidity screening as known in the art 
More specifically, in step 230 of the preferred embodiment 
the data set is partitioned according to the illustration in FIG. 
6. In step 240 stopping criteria f<^ the training process are 
selected, which may be one or more of the following: a) the 
average error is below a certain threshold; b) the number of 
passes through the training set since reaching a minimum 
average enror exceeds a pre- specified number; c) the largest 
prediction error is below a certain level; or d) the number of 
passes through the traioiog set has reached a preset limit 

In step 250 each network is trained using data from die 
inputs provided by the preprocessing system 305 and as 
extracted by block 420 in FIG. 6. During training, die 
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network is tested every P eveots (an event involves data 
pertaining to one time period), where preferably Ps=20, to see 
whether a better net has been found, in which case the 
parameters of the -new net arc saved For each stock, and 
correspondingly for each training process, a stopping crite- 
rioD is defined such that after the system reaches a certain 
degree of confidence, or a oeitain aror rate in the recogni- 
tion of input patterns, the training process is tcmunatcd. The 
training process can be stopped when any of the above 
stopping criteria is met. 

Considering step 250 in further detail, in the preferred 
embodiment of the system of the present invention the 
training process uses all available data about a particular 
stocic Thus, eadi week a set of new values f<x each input to 
the neural nets is stored and used in the training process. As 
illustrated in FIG. 6, the entire data sequence is divided into 
a training set. a test set and a validation set. Speciftcally. in 
step 400. the data for the M most recent weeks, where M is 
the time horizon of the system, is extracted to form a 
separate *'production set,*' The training, test and validation 
sets of data arc chosen randomly, subject to the constraint 
that the extreme values of the ALPHA parameter are rej^e- 
sented in all sets. Excluding the production set. preferably 
about 65^ of the data is used for training (R %), about 15% 
is used for the test set (S %) and about 20% is used for the 
validation set (V %). Thus, in a case of about 400 weeks of 
available data, where the prediction is made 12 weeks ahead, 
about 252 weeks are used for training. 58 weeks are used for 
testing and 78 weeks are used for validation. A person 
skilled in the art. on the basis of this disclosure, can select 
different values where appropriate. 

The validation set is first extracted in step 410, based on 
the tails of the output parameter distribution. Tails arc 
defined to be more than about D (initially 0=1.5) standard 
deviations away from the mean, as long as at least 60% of 
the tails remain for inclusion in the training and test sets. If 
such condition is not satisfied by the available data 
sequence, the value for D is lowered in steps of 0.2, until the 
condition Is met. Following that selection, the training and 
test sets are extracted in stq) 420 in a random fashion from 
the remaining data, subject to their relative percentages 
constraint. 

In accordance with the preferred einbodiment of the 
present invention the net is trained using the training set The 
test set is used to decide when to stop the training process, 
and the validation set is used to decide how good the net's 
predictions are. Spedflcally, the network is tested several 
times during a complete pass through the training set (one 
complete pass through die training set is defined as an 
cpodi). Fot example, the network can be tested every 20 
weeks of data and the network weights are saved if the 
perfarraaace of the current neural net on the test set is better 
than the performance of the previously saved neural net. The 
training process stops after about 250 epochs, or if the 
network goes through 20,000 events of weekly data (which 
includes several passes through the same data) without 
in^rovement, or if the average root-mean squared (RMS) 
error is less than a given threshold (0.001 is a typical value 
which can be used for the logistic activation function at the 
output node). 

Finally, the validation set is used to determine how good 
the ALPHA parameter estimates arc for the particular stock. 
Dependent on the recorded etrors in the validation process, 
the system assigns a measure of goodness (or reliability) for 
the parameter ALPHA estimate of a particular stock, and 
uses this measure later in the process of optimized portfolio 
gcnaation. 



12 

During the training process the weights of the network arc 
adjusted in increments or decrements which ctHxespond to 
the prcspcdiicd learning rate of the net (higher learning rates 
may result in faster "convergence," but also often produce 
5 larger prediction errors). The system of the preferred 
embodiment of the present invention additionally uses a 
momentum factor, which takes into account tiie previous 
weight change in determining the stq) size of the fffcscnt 
weight change. Such a momentum factor is useful in reduc- 
ing the effects of noisy data and makes the training process 
dependent on how close the net is to its final state. In the 
system of the present invention, typical values for the 
learning rate and the momentum are 0.1 and 0.1. respec- 
tively. 

J 3 One of the problems in stock or market return modeling 
is the propensity to "overfit** the model to the training data, 
which usually results in poor performance oo the test set and 
production set. In the present invention, the use of both test 
and validation sets alleviates this problem because the 

2Q performance of the net can be monitored periodically. More 
specifically, if the net is overtrained with data from the 
training set tiie prediction accuracy of the net when applied 
to the test set data is likely to drop. In this case the system 
may discard the overtrained net and use a previously saved 

23 net configuration. In any case, the validation set is used to 
insure that the performance of the net on data, which is not 
used in the training process or in determining when to stop 
the training, does not deteriorate. 

Tuning back to FIG. 4. in step 2<)0 the trained network is 
finally used to predict the appreciation potential of the 
particular stock M weeks ahead by running the current input 
factors derived from new market data (the production set) 
through the neural net. As discussed above, in a preferred 
embodiment of the present invention, the appreciation 

33 potential of the stock is estimated 12 weeks ahead and is 
expressed in terms of predictions of its ALPHA parameter. 

In a spediic example, assuming the ALPHA parameter 
predictions arc being made 12 weeks forward, the produc- 
tion set which is hot employed in the training process 

4Q consists of the latest 12 weeks of data. Using the trained net, 
the system first generates an ALPHA parameter prediction 
Pi for the following week (time T+1) using data from 11 
weeks ago (T-1 1). Next, an ALPHA parameter prediction Pj 
is generated for two weeks into the ^ture (T+2}, using data 

45 from ten weeks ago (T-10); and so on until a total of 12 
predictions P^ (T+i2 being the last time horizon) for the 
ALPHA parameter are generated. Each of the predictions P, 
is associated with a partial actual ALPHA value A,, which 
corresponds to the actual ALPHA parameter realized until 

3Q the current time for the security. Thus, A^ is the partial actual 
ALPHA parameter from 11 weeks ago through the current 
week, and A13 is equal to sero. On the basis of the predic- 
tions P, and actual partial values A, an average unrealized 
ALPHA parameter value for the security is calculated, using 

33 the following formula: 

I 12 

Ayct. uzncaluod ot - "rx- 1 (/** - ^i) 
1* i=l 

60 The average unrealized ALPHA is used as a summary 
statistic for the appreciation potential of each stock, and is 
used in the portfolio optimization process. 

HG. 7 depicts a display of the actual and predicted 
12-week ALPHAS from the validation set for a specific stock 
65 using the trained neural net for that stock. 

In accordance with another aspect of the present 
invention, a model portfolio of stocks may be generated on 
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the basis of the stock apprcciatioD potentials estimated by 
the neural nets 300 and a set of factors that control risk, such 
as the BETA and the Divcrsifiablc variance, as well as 
investor preferences. 

RefeiTiog back to FIG. 1, the selection of an optimized 3 
portfolio of stocks is done in accordance with tiie preferred 
embodiment of the present invention using a program imple- 
mented in the data processing system 310. Data processing 
system 310 receives and stores in storage 330 data about the 
^preciation potential of ea<A stock in the capital market and jo 
then conelates this input data with risk factors and investor 
defined constraints stored in storage 320 to generate an 
qptimized portfolio of stocks. Data processing system 310 
further models the performanoc of the overall market, and 
IH-edicts the future correlation between the return of the is 
selected portfolio and at least one index of the c^tal market 
to ensure that the portfolio will provide a desired investment 
return ova* the predetermined period of time. 

Typically, the selected optimized portfolio contains much 
fewer stocks than the underlying p^ormance benchmark 20 
portfolio (such as the S&F 500 Index or the S&P 400 
MidCap Index). This is because the pcmfolio creation pro- 
cess concentrates on stocks that have a large ^predation 
potential along with favorable liquidity and risk character- 
istics. 23 

The portfolio optimization of the preferred embodiment is 
implemented using commercially available software mod- 
ules that aid portfolio selection and risk and investment 
analysis. Tliese include coimnercial optimizers such as die 
''Solver*' which is part of the Eicel spreadsheet program 30 
from Microsoft Corporation, the "Evolver", a genetic algo- 
rithm based program from AxccUs Inc, 4668 Eastern 
Avenue North, Seattle, Wash. 98103, or software packages 
which are available from the Harvard Business School in 
Cambridge, Mass.. or vendors such as Baira International in 33 
San Francisco, Real Decisions Corporation in Darien, Con- 
necticut and others. Such commercial portfolio opCimizers 
can be integrated as part of data processing system 310. 
Otho* commercial products or custom software can be used 
and developed by persons skilled in ttie art on the basis of 40 
the disclosure. 

In operation, data ^s-ocessing system 330 continuously 
monitors the peiformance of the optimized poftfolio and 
compares its overall investment return to the broad market 
bcndunarks of the associated capital market. In addition. 45 
neural nets 300 con^)ute the ^preciation potential param- 
eter ALPHA for each stock in die market. This information 
is processed by the portfolio optimizer in system 310 which 
determines the composition of the optimized portfolio. In 
the U.S. capital markets this portfolio includes about 50-70 50 
stocks; for example, the MidCap portfolio has about 60 
stocks out of a total of 400 stocks. If the system detects a 
degradation in the cumulative ALPHA parameter of the 
portfolio, or if the neural nets indicate high appreciation 
potential in stocks which are not included in the optimized 55 
portfolio or low potential in stocks which are part of the 
c^timal portfolio, the composition of the portfolio can t>e 
changed. 

In accordance with a preferred embodiment of the present 
invention, rebalancing of the optimized portfolio can be 60 
effected at regular time intervals, such as one week, or be 
triggered by large variations in the output of the neural nets. 
In particular, data processing system 310 receives input 
about the predicted ALPHA parameter of each stock in the 
market from the neural nets. This information is analyzed 65 
along with other factors, such as the BETA parameters of the 
stocks, market sector constraints, transaction costs and other 
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data stored in storage 320 and 330 to check whether the 
existing stock portfolio's expected return can be inoproved. 
As a result of the analysis, the portfolio optimizer may 
suggest rebalancing the previously optimized portfolio by 
one or many of the following: 1) changing the existing stock 
positions in the portfolio, e.g. suggesting a 3% position in a 
particular stock from an earlier 2.1% position; or 2) adding 
a new stock to the portfolio with an appropriate weight; or 
3) deleting an existing stock from the portfolio. The sug- 
gested changes can then be imj^emented by a human 
operator. 

The above described preferred eixibodiment of the system 
of the present invention is merely illustrative of the prin- 
c^les of this invention. Numerous modifications and adap- 
tations thereof will be readily apparent to tfiose skilled in the 
art without departing from the spirit and scope of the present 
invention which is defined in the following claims. 

APPENDIX A 
Inputs and Ou^uts to the Neural Net 

The following are the inputs to the neural net Those with 
(♦*) are not used by the neural net 
** TICKER 

This is the ticker of the stock. (e.g. ACN) 
** DATE 

This is the Date of the data as of Friday. Dates start in 
January, 1985. (e.g. Apr. 29, 1988) 
OA 

This is the surprise ranking factor (measuring the ^db~ 
ability of an earnings surprise in the forthcoming quarta) as 
downloaded from Chicago Investment Analytics. Values 
range from 1 to 100 (a 1 implies the highest probability of 
positive earnings surprise in CIA*s universe of stocks) and 
values that wae undefloed for certain weeks are assigned a 
50. (e.g. 13). 
CLV_SLOFE 

This is the regression slc^ of the surprise ranking factor 
values for the last N (currently 6) observations, (e.g. 
0.2041). 
BETA 

Beta (measure of volatility) for the ticker (stock) returns. 
(c.g. 1J263) 
Beta=CoVar StisA^ar Index 

CoVanKSum of all (RIndex-AVG*RSTK-AVG)/Nuraber 
of periods) 

Var Index=(Sum of (RIndex-AVG*RIndcx-AVG)/ 

Number of periods) 
RIndex-AVCj=Cun*ent Return on Index-Average Return 

on Index 

RSTK-AVG=Current Return on STK-Avcrage Return on 
STK 

Current Return on Index and STK are front-end loaded 
with the period weighing factor for calculating Beta and 
other factors below. 
BCTA_SLOPE 

This is the regression slope of beta for the last N 
(currently 4) observations (weeks), (e.g. 0. 1334) 
DIV_VAR 

DiversiAable Variance (e.g. 0.2469) 

Div Var=Var OTC-(Beta*BcU'*Var Index) 

Var STKKSum of Squares of RSTK-AVG)/Numbcr of 
periods 

Var Index=(Sum of Squares of RSP-AVG)/Numbcr of 
periods 
SDEV^ALPHA 

Standard Deviation of A^ha value. Alpha is the difference 
between the stock's actual return and the expected return 
(based on its Beta), (e.g. 11.9753) 
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A^rOCH 

Cumulative Alpha Stochastic (e.g. 83.3330) 

Average of latest 6 weeks of K A. 

K A=<(CuiTeDt week cumulative Alpha-Minimum cumu- 
lative Alpha over 6 weeks)/(Maximum cumulative 
Alpha over 6 periods Minimum cumulative Alpha over 
6 weeks))* 100 
A_SLOPE 

Slope of the cumulative Alpha Stochastic for the current 
pericxl (T) which is the regression slope of the cumulative 
Alpha Stochastic value for the last N (currently 4) observa- 
tions starting at the current period (week). (e.g. 3.9160) 
A_SL0PE_T1 

Slope of the cumulative Alpha Stochastic for the previous 
period (T-1) which is the regression slope of the cumulative 
Alpha Stochastic value for the last N (currently 4) observa- 
tions starting at the previous period (previous week), (e.g. 
0.9560). 
A„SL_SL 

Slope of the Slope (second derivative) of the cumulative 
Alpha Stochastic, (e.g. 2.9600). The formula is simplified to 
the difference between the slopes of the cumulative Mpttz 
Stodiastics at the current and the previous periods. 

AL.SL_SL=A_Slopc-A^Slopc_Tl 
INFL 

Inflection indicator based on the value of the cumulative 
Alpha Stochastic (e.g. 0) 
If Alpha_Slope>=0 and Alpha_Slope_Tl<0 then Infl=l 
Else 

If Alpha_Slope<=0 and Alpha_Slope_Tl>0 then 
Infl=-1 Else Infl=K) 
**POWER FACTOR 

Return on Stock:=((Cun- Stock Qose-Stock Qosc previ- 
ous wcck)/Stock Qosc previous week)* 100*Pcriod Weight- 
ing 

Return on Index=((Curr Index Qose-Index Qose previ- 
ous wcck)/Index Qose previous week)* 100* Period 
Weighting 

Power Factor^Retum on Stock-Retiim on Indcx)*Period 

Weighting 
POS.PWR_FC 

Positive Power Factor, (e.g. 0.2313) 

Positive Power Factor=Cumulative Power FactorA'aluc 

AA^alue B 

Value A=Numba of positive values in return on Index 
Value B Standard Deviation of Power considering only 

those time periods with positive return on the Index 
NEG_PWRJC 

Negative Power Factor, (e.g. 0.7823) 

Negative Powa Factoi=Cumulativc Power Factor/Valuc 

CYValueD 

Value C=Number of negative values in return on Index 
Value I>=Standard Deviation of Power considering only 
those time periods with negative return oo the Index 
PWR_SLOPE 

8-week regression slope of die Negative Power Factor 
values. (e.g. 0.0422). 
LFM_FC„MN 
Lx)wer Partial Moment (e.g. 13,4200) 
LPM_FC_>IN=Avaagc of LPM Factor over each week 

in die lookback period. 
LPM Facton=(Maximum of 0 or (Beta Adjusted Return oo 
Index-Return on OTK)) to the power of LPM Degree. 
LPM Degree is a parameter typically in the range of 1.5 
to 2.5 (cuirenlly 2). 
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ALPHA_PACT 

Alpha Past=Avcragc of Alf^a over the lookback period. 
(e.g. 1.9509) 
RESIDUAL 

5 Residual=l-r squared value of linear regression of STK 
Return and Index Return, (e.g. 0.5340) 
LTPE 

Long Term P/E Ratio=Cuirent Closing Price/Scaled Earn- 
ings. (e.g. 0.5747) 
10 Scaled Earnings is the 12 -month trailing earnings trans- 
formed to a value from 1 to 100. 
RELPE 

Relative Price Earnings is the P/E relative to the index. 
(e.g. 2.2010) 
13 AVGVOL 

Average Volume Factor (in percent), (e.g, 101.8800) 
AVGVOL=100*(Current week*s volume/Average weekly 
volume over the last 10 weeks) 
Note: Only one of the following ou^ts is considered at a 
20 time by the ncin-al net 
STK_BEri2 

Stock Return 12 weeks into the future, (e.g. 0.01) 
STK_RET12=(Stk Close 12 weeks ahead-Curr Stk 
aose)/Cuir Stk Close. 
" ALPHA_4_JUTURE 

Alpha 4 weeks into the future. e.g. 0.09 
Alpha_4_Future=STK_REr4-((Indcx_RET4-Cash_ 
REr4)*Beta+Cash_RHr4) 

STK^_REr4=(Stic Close 4 weeks into the fumne-Curr Stk 

CloscVCurr S± Close, 
Indcx_REr4^Index Qose 4 weeks into the future-Curr 

Index Close)/CurT Index aosc. 
Cash_REr4=(Cash Qose 4 weeks into the futurc-Cuxr 
35 Cash OoseyOm Cash Qose. 
ALPHA_12_JTJTURE 
Alpha 12 weeks into the future, (e.g. -0.15) 
Alpha. 1 2_Futiire=STK_REr 1 2-((Index_RET 1 2- 
Cash_REri2)*Beta+Cash_REri2). 
40 STIL_REri2=(Stk Qose 12 weeks into the future-Cuir 
Stk aose)/Curr Stic Qose. 
Index_REri2=(Index Qose 12 weeks into the future- 

CiuT Index aose)/Curr Index Close. 
Cash.REri2=(Cash Close 12 weeks into the future- 
Cuir Cash CloscyCurr Cash Qose. 
ALPHA__24_FirrURE 
Alpha 24 weeks into the future, (e.g. 0.21) 
Alpha_24_Future=STK^Er24-((Index__KET24- 

Cash_REr24)*Beta+Cash_.RFr24). 
STK_JlEr24=(Stk Qose 24 weeks into the future-Curr 

Stk aose)/Curr Stk aose. 
Indcx.JiEr24=(Index Close 24 weeks into the futurc- 
Cun Index Close)/Cuir Index Qose, 
55 Cash_jyET24=(Cash Qose 24 weeks into the ftjture- 
Curr Cash aose)/Chirr Cash Qose, 
Note: This variable is used to designate which **rows" go 
Into the training set (R)« the test set (S) and the 
validation set (V). 

60 TT 

Code for the Neural net which can have the values R, S, 
V. (e.g. R) 

The values are distributed based on a percentage (of the 
entire dataset) to each of the 3 values. e,g. 65%, -15%, 20%. 
65 Note: Factors (such as Beta) that require a lookback period 
for the calculation typically use a value in the range of 50 to 
120 weeks (currently 84). 
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APPENDIX B 

Coniiguration fUe f<H' Batch Stock Neural Net TVaiziing/ 
Testing 

"Input file type:"*. 0 /* Conoraa separated asdi file */ 
^'Ou^ut file type:**, 0 5 
"Header row: *M /* Number of header rows */ 
**Input dir:**, "cioput directory namo" 
**0u1put dir:", "<output directory name>" 
**Maximal number of columns :'*J26 

**Maximal number of rows". 500 lo 
^'Number of columns: **,26 /* Actual total number of columns 
in file */ 

"Number of rows", 397 1* Actual number of rows in first file 
*/ 

"Pattern extraction method:**,0.2^,l /* Pattern extraction 15 

using TT column */ 
**Number of all parameters" J26 /* Numl)er of parameters 

(Ustcd below) */ 
**Parameter# 1:";TICKER" 
"Parameter # 2:", "DATE" 
•Parameter #3:", "CIA" 
•Parameter # 4:**."CIA_SL0PE" 
•Parameter # 5:"/*BETA" 
'Parameter # 6:"/*BErA_SLOre** 
"Parameter # 7:";*DIV_VAR" 
'Parameter # 8:";*SDEV_ALPHA" 
'Parameter # 9:","A_SL0PE" 
'Parameter # 10:"."A__SLOPE_T* 
^Parameter* 11:*',"A_SL_SL" 

'Parameter # 12:","INFL" 30 
'ParamctCT # 13:";*ASrOCH" 
'Parameter # 14:","P0S_PWR_FCr 
•Parameter # 15:","NEG_PWR JC* 
'Parameter # 16:","PWR_3L0PE** 

'Parameter # 17:'*,"LPM_FC_MN** 35 
'Parameter # 18:'*."ALPHA_PASr' 
'Parameter # 19:"."RESIDUAL" 
'Parameter # 20:" "LTPE" 
•Parameter # 21:"/'RELPE" 

"Parameter # 22:" ." AVGVOL" 40 

•Parameter # 23:","STK_REri2" 

•Parameter # 24:".*'ALPHA_4_FTJTURE" 

•Parameter # 25:","ALPHA_12_PinrURE" 

•Parameter # 26:","ALPHA_24_FUTURE" 

"Number of input parameters:", 16 /* Actual number of 45 

inputs (listed below) */ 
'Input parametcr:"/'CIA**,l 
"Input parameter:","CIA_SLOPE"a 
"Input parameter: ","BETA",1 

"Input parameter".'*BETA_SLOPE:,l 50 
"Input parameter ","SDEV_ALPHA"J 
•Input parameter'*,"A_SLOPE_Tra 
"Ii^nit parameter; VlNFL^.l 
"Input paramcter:*'/'ASTOCH",l 

'Input parameter**,'*NEG_PWR_FCra 55 
'Input paramcten"/PWR_SLOPE".l 
"Ii^t parameter:"/'LPM_FC.3IN".l 
'Input parameter:"/'ALPHA_PASr*a 
'Input parameter:","RESIDUAL"a 

"Input paramcter:**.*lTPE**,l eo 
'Input parameter :**.'TtELPE*'J 
"Input parameter: VAVGVOL"a 

"Number of output parameters:", 1 /* Number of outputs ♦/ 
**Ou^ut pararaetcr:**,"ALPHA_12_FLrnjRE'*a /* Ou^ 
and scaling type */ ^ 
/* ALPHA_4_FUTURE is the other commonly used 
output •/ 
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"Net type:**^ /* Net architecture type ♦/ 
I* 2 hidden slabs with direct input-output links */ 

"Slab 0:",16,0 /* Input slab: 16 nodes */ 

"Slab /♦ Hidden slab 1: 11 nodes with Gaussian 

Activation */ 

"Slab 2:**aiJ /» Hidden slab 2: 11 nodes with Gaussian 
Compl. Activation */ 

"Slab 3:^.12 /* Output node: 1 node with Logistic Activa- 
tion */ 

"Slab 4:",1,0/* Unused V 

••Random seed:**, 1,1 /* For initialuing link weights */ 
"Initial weights range:** .03 

"Learn ratc:**,0.1 /* Parameter in backprop algorithm */ 
"Momentum:*" .0.1 /♦ Parameter in backprop algorithm */ 
"Pattern selection i*".© /* Random pattern selection each 
epoch */ 

"Missing values:** .4 /* Report error on missing values */ 
'•Weight update:",! ^ Use momentum-based weight update 
♦/ 

"Net perfect interval: **.20 /* testing interval */ 
"Stop when learn epochs>:*'.1^50 /* criterion for stopping 
training */ 

"Stop when average error<:**. 1,0.001 /* criterion for stop- 
ping training */ 

"Stop when events since . . . >:",1. 20000 /♦ criterion for 
stopping training */ 

"Stop when largest erroa*<:**,O.O.l /♦ not used */ 

"Apply to train:**, I /* after training, apply net to training set 
*/ 

"y^ply to tcst:**,l /♦ after training, apply net to test set ♦/ 
'•Apply to production !:**,! /* after training, apply net to 

validation set */ 
"Apply to production2:**.l /* after training, app4y net to 

production set */ 
"Automatic setting of hidden neurons: **. 0 /* Set no, of 

hidden nodes manually */ 
"Min/max calculation method:**,0 /* use only training data 

for min/max calc ♦/ 

"Rem Control block to work in unix — " 

"Rem Operation: -l=no operation. 0=start over. l=continue 

existing. 2=^pply net 
"Operation :**,0 /* re- start training from scratch every week 

V 

What is claimed Is: 

1. A system comprising one or more computers for 
selecting a portfolio of securities expected to provide, within 
a predetermined period of time, a return superior to a 
risk-adjusted performance of a selected market index, the 
system comprising: 
a plurality of predictive neural network means for esti- 
mating an appreciation potential of participating secu- 
rities of a capital market over the period of time, each 
of the neural network means estimating the apprecia- 
tion potential of a separate individual security of the 
participating securities, wherein each of the neural 
network means has been trained using data relating to 
a corresponding security so as to adjust weights of each 
of the neural network means; 
portfc^o construction means for selecting an investment 
portfolio 00 the basis of the estimated appreciation 
potential of the participatijig securities, determined by 
the plurality of the predictive neural networks, such 
that the expected return on the selected investment 
portfolio over the predetermined period of time is 
superior to the performance of the selected index; and 
means for receiving infonnation at>out the participating 
securities. 
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2. Ttie system of claim 1 wherein the appreciation potcn- using a plurality of predictive neural network means, 
tiaJ of each security is a parameter which is related to the wherein each of the predictive neural network means 
expected excess return of the security over the risk-adjusted estimating appreciation potential of a separate indi- 
index return of the capital market. vidual security of the participating securities, wherein 

3. The system of claim 2 further comprising: 5 each of the neural network means has been trained 
means for computing a tracking oror indicative of the using data relating to a correspondiDg security so as to 

deviation of the determined performance of the invest- adjust weights of each of the neural network means; 

ment portfolio from the risk-adjusted index value the selecting an optimized investment portfolio on the basis 

capital market; and of the estimated appreciation potential of the partici- 

means for rebalancing the investment portfolio on the pating securities, determined by the plurality of the 

basis of the input from the means for computing. neural network means, such that the expected return on 

4. The system of claim 2 wherein the portfolio optimiza- the selected investment portfolio over the predeter- 
tion means comprises: rained period of time is superior to the performance of 

storage means for storing data for earfi security in the ^ selected index; and 

capital market; and receiving information on each security of the selected 

optimization means for selecting an optimal portfolio on investment portfolio, 

the basis of the input from the storage means; wherein ^f claim 11 wherein the ^prcdation 

the optimization means operate in accordance witii an potential of each security is a parameter related to the 
optimization function using a set of constraints on the ^ expected excess return of the security, over the index return 

estimated ^predation potential of each security, risk capital market 

factors and investor preferences. The method of daim 12 further comprising the st^s 

5. The system of claim 2 further comprising means for 

expressing the appredation potential for eadi security in computing a tracking error indicative of the deviation of 

both U.S. currency and the local currency of the coital ^, the determined perfcffmance of the investment portfolio 

market from the index value of the capital market; and 

6. The system of claim 3 wherein the means f(x rebal- rebalancing the investment portfolio on the basis of the 
ancing of the investment portfolio comprises means for computed tracking error. 

generating a list of the security holdings of the p<vtfolio 14. The method of claim 12 wherein the step of selecting 
arranged in a descending order of the estimated appreciation ^ further comprises the steps of: 

potential. storing data related to each security in the capital market; 

7. The system of claim 2 wherein the neural network and 

means operates on the basis of input data associated with optimizing the investment portfolio on the basis of an 

eadi security of the portfoUo relates, whidi input data optimization function which uses a set of constraints on 

comprises: a set of fundamental financial diaractcnsUcs for estimated appredation potential of each security, 

the security, a sequence of price returns and trade volume ^5,^ f^^^^ investor preferences, 

statistics for the security, and a sequence of price returns of 15 i^e metiiod of claim 12 further comprising the step of 

the index assodated with the capital nurket. exi^essing the appredation potential for each security in 

8. The system of claim 7 wherein the neural networic y §. currency and the local cuuency of the capital 
means comprises: an output node for outputting a signal ^ market 

related to the estimated apfffcdation potential of a security, jy^ tneOiod of daim 13 wherein the step of rebal- 

a plurality of input nodes for receiving input data, and a anting the investment portfoUo comprises (he step of gen- 

plurality of intermediate nodes connected between the input ^^^^ ^ security holdings of the portfoUo 

nodes and the ouQ)ut node and having links with weight arranged in a descending order of the estimated ^predation 

values being changed by using a rq)ctitivc weight adjust- potential 

ment based on the evaluation of the output from the output 17 ^^^^ c^^im 12 wherein the step of estimating 

c comfdses the st^ of providing to the network means input 

9. The system of claim 8 further con^sing correction ^ associated with each security of the portfoUo, which 
means for evaluating the signal received from the output i^^^ ^ comprises: a set of ftmdamcntal financial char- 
node, the COTection means outputting a signal for dianging ^ acteristics for the security, a sequence ofprice returns fOT the 
the weight values of Ae Unks when it is detennined that a security sampled at a second Umitcd period of time, and a 
correction is required, sequence of price returns of the index associated with the 

10. The system of daim 7 wherein the neural networic ^^^^ sampled at the second Umited period of time, 
means compnses a neural nctw<»k for each secunty of the ^ system comprising one or more computers for 
capital market and the input to each neural network further ^^j^^g 3 p^rtf^,^^, securities whidi is expected to 
compnses: provide, within a predetermined period of time, a return 

a parameter p related to the systemic risk of the security superior to the risk-adjusted performance of a selected 

with respect to the underlying Index associated with the market index, coursing: 

capital mark^; a plurality of predictive neural netwcyks for estimating an 

a parameter p^/^ assodated with the change of the p $0 appreciation potential of participating securities of a 

parameter over the second limited period of time. capital market over the period of time, wherein each of 

11. A method for administBring an investment portfolio of the neural networks is estimating the appreciation 
securities expected to provide, within a predetermined lim- potential of a separate individual security of the par- 
ited period of time, a return superior to performance of a ticipating securities, wherein each of the neural net- 
selected market index, the method con^rising the steps of: 63 works has l>een trained using data relating to a corre- 

estimating an appreciation potential of participating secu- sponding security so as to adjust weights of each of the 

rities of a capital market over the limited period of time neural networks; 
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at least one input data prq>roccssor for geaerating data fof 
each of the plurality of the neural networks, compris- 
ing: 

first means for receiving for each of the participating 
securities of the capital market current and historical 
trading infonnation; ' 

second means for receiving for each of the participating 
securities of the capital market fundamental infor- 
mation about the security, such fundamental infor- 
mation comprising earnings related data; and 

at least one processor for constructing for each of die lO 
participating securities of the coital market a set of 
input data sequences generated from the capital 
market current and historical trade infotnation and 
fundamental inftrmation. said data sequences being 
supplied to the the plurality of predictive neural 15 
networks for forming an estimate of the appreciation 
potential of each security over the predetermined 
period of time, wherein each of the neural networks 
estimates the appreciation potential of a separate 
individual security of the participating securities. 

19. The input data preprocessor of claim 18 further 
comprising means for receiving index information related to 
the return of the capital market and second storage means for 
storing said input data sequences. 

20. The input data preprocessor of claim 19 wherein at 
least one data sequence includes values related to the BCTA ^ 
parameter associated with the cc^esponding security. 

21. The input data preprocessor of claim 20 wherein at 
least one data sequence includes values related to the 
ALPHA parameter associated with the coiresponding secu- 
rity. 



22. The input data preprocessor of claim 20 further 
comprising means for estimating for each security of the 
capital market the correlation between each input data 
sequence and the investment return of the corresponding 
security. 

23. A method for computerized selection of a portfolio of 
securities expected to provide, within a predetermined 
period of time, a return superior to the risk-adjusted perfor- 
mance of a select market Index, comprising the steps of: 

for each participating security of a capital market elec- 
tronically acquiring fundamental and price related 
infonmation; 

electronically analyzing the acquired information for each 
participating security using a separate predictive neural 
network which has been trained using data relating to 
a corresponding security so as to adjust weights of the 
neural network, to estimate the appreciation potential 
of each participating security relative to the select 
market index; and 

selecting an investment portfolio on the basis of the 
estimated appreciation potentials for Che |>articipating 
securities of the coital market such that the expected 
return on the selected investment portfolio is superior 
to the performance of the select market index. 
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